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Abstract 

In a variety of studies of dynamical systems, the edge of order and 
chaos has been singled out as a region of complexity. It was suggested 
by Wolfram, on the basis of qualitative behaviour of cellular automata, 
that the computational basis for modeling this region is the Universal 
Turing Machine. In this paper, following a suggestion of Crutchfield, 
we try to show that the Turing machine model may often be too 
powerful as a computational model to describe the boundary of order 
and chaos. In particular we study the region of the first accumulation 
of period doubling in unimodal and bimodal maps of the interval, from 
the point of view of language theory. We show that in relation to the 
"extended" Chomsky hierarchy, the relevant computational model in 
the unimodal case is the nested stack automaton or the related indexed 
languages, while the bimodal case is modeled by the linear bounded 
automaton or the related context-sensitive languages. 
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1. Introduction 

The complex systems that we often observe, both in nature and otherwise, 
are characteristically poised in a delicate balance between the dullness of 
order and the randomness of disorder. In recent years a lot of effort has been 
expended in obtaining quantitative measures which would provide suitable 
definitions for this complexity. 

In a study of the qualitative behaviour of cellular automata, Wolfram 
[|I|] noticed that there was a class of automata, which he called Class 4, 
whose members displayed complex dynamical behaviour. In particular, they 
seemed to exhibit transients which were arbitrarily long-lived. Wolfram sug- 
gested that this behaviour was reminiscent of the undecidability which char- 
acterised the halting problem for Turing machines. He conjectured that Class 
4 automata then might have a universal Turing machine embedded within 
them. 

The model of the Turing machine has served, in the past, to provide a 
deeper understanding of phenomena which had been investigated through 
other means. The most notable amongst these is the development of the 
idea of algorithmic information which is the computational counterpart of 
the traditional Shannon entropy. These ideas have served to provide a quan- 
titative basis for the qualitative notion of randomness. Conceived originally 
through the efforts of a number of people, it has shed light on some deep 
issues related to the foundations of mathematics, mainly through the work 
of Chaitin @. 

In || we tried to show that cellular automata, with Turing machines 
embedded within them, may not always display complex behaviour. In our 
present work we try to demonstrate that the model of the Turing machine 
may at times be too powerful to describe the computational complexity at 
the edge of order and chaos. Crutchfield j3j has argued that instead of looking 
only at the Turing machine, it might be wiser to look at the entire hierarchy 
of machines that computation theory provides us with. Classical compu- 
tation theory provides a beautiful framework in which models of machines 
or automata are related to the languages they recognise, which are in turn, 
generated by their respective grammars. This hierarchy is known as the 
Chomsky hierarchy. The Turing machines form the top of this hierarchy. 

The dynamical systems we study are, the rather well understood, iterated 
maps of the interval. We investigate the onset of chaos which is exhibited by 
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the first accumulation of period doubling for the case of the unimodal and 
bimodal families of maps. The symbolic dynamics and the kneading theory 
for these cases are well known [|, |5|. We demonstrate that the language 
generated by the kneading sequences in these two cases can be recognised by 
machines which lie lower down in the Chomsky hierarchy. 

2. The symbolic dynamics of Maps of the interval 

In this section we set up the notation used by Mackay and Tresser || to 
describe the onset of chaos at the first accumulation of period doubling. 

2.1. The unimodal case 

A unimodal map is, by definition, a continuous map / of the interval I — [0,1] 
into itself, which possess a single turning point c. Normally / is chosen to 
so that it monotonically increases in the interval [0, c) and decreases mono- 
tonically in the interval (c, 1] . With every point in / we associate a symbol 
L or R depending on whether it lies on the left or on the right of c. c is 
identified with the symbol C. In this way the orbit of any point in I un- 
der iteration by the map / can be associated with a sequence []of symbols 
from the set A = L, C, R. The kneading sequence which is defined to be the 
symbol-sequence corresponding to the orbit of the point /(c), is of particular 
importance in the symbolic dynamics. It has be shown, in the pioneering 
work of Milnor and Thurston |7|], that the kneading sequence controls the 
possible symbol-sequences that can occur for a given map and that it deter- 
mines important ergodic properties of the map, like the topological entropy. 

In order that we can define the symbol-sequences of interest to us, we 
need mention here only a single detail from symbolic dynamics. Symbolic 
sequences can be ordered in a way, such that they respect the ordering of the 
interval I i.e. if x, y e I have symbol-sequences s(x) and s(y) respectively, 
then 

x < y iff s(x) < s(y). 
This ordering is defined as follows: 

1 As is done conventionally, an eventually periodic sequence of symbols will be described 
as a finite sequences by enclosing the periodic part in brackets with the symbol "oo" at 
the end as in • • • (• • -) 00 
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First we define an ordering on the symbols as L < C < R. Now, if 
A = Xa . . . and B = Xb . . . are two sequences, for which X is the common 
prefix-sequence and a, b e {L, C,R}, a ^ b then 

A < B if a < b and X contains an even number of i?'s or 

a > b and X contains an odd number of i?'s. 
A > B otherwise 

It is well known that families of unimodal maps (like the well-known lo- 
gistic family) exhibit the route to chaos through period-doubling transitions. 
The renormalisation group theory pioneered by Feigenbaum || provides a 
beautiful description of this phenomenon. For our purposes it suffices to 
describe the kneading sequences that arise through successive applications 
of the renormalisation group transformation. The limit sequence then can 
be thought of as the symbolic description of the onset of chaos, obtained 
through an accumulation of period-doubling transformations. The algorithm 
for generating these sequences is as follows: 

= 

where the U^ n > and are chosen from {L, R} so that 

We mention that this algorithm generates the kneading sequences K n = 
(X( n 'C)°°,n > of maps, having a superstable periodic orbit of period 2 n . 
Moreover, the set of periods of periodic points of the map with the kneading 
sequence K n , is given by {1, 2,4,..., 2 n }. Each of these maps have a finite, 
stable attracting set and hence display ordered dynamics. 

We also define another set of kneading sequences which would be useful 
for the discussion 

K' n = (l(«)^)xW[/(«)x(») { /W)oo ) n > o 

The sequence K' n represents a map which contains an orbit of period 3 • 2 n . 
These maps have positive topological entropy and can, in this sense, be 
considered as displaying chaotic dynamical behaviour. 

The sequence = K'^, is the kneading sequence of the map at the 
onset of chaos. The set of periods, of the periodic points of this map is given 
by {1, 2, 4, . . . , 2 n , . . .}. The topological entropy in this case is zero. 

For the details of these results, we refer the reader to the literature 0, §]. 
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2.2. The bimodal case 



We now describe the accumulation of period-doubling in bimodal maps through 
their symbolic dynamics. Once again we refer the interested reader to for 
the details. 

A bimodal map of the interval J, is a map / from / into itself with 
two turning points, c and k. In what follows we consider the case of the 

H h maps, i.e. maps which are monotonic increasing on (0, c) and (k, 1), 

monotonic decreasing on (c,k). Each point in / is now associated with a 
symbol from the set {L, R, B} depending on whether it belongs to the interval 
(0, c), (c, k) or (k, 1) respectively. The points c and k are assigned the symbols 
C, K respectively. The symbol-sequence corresponding to the orbit of a point, 
consists of a string of symbols from the set S = {L, C, R, K, B}. In this case, 
the kneading data of a map corresponds to a pair of kneading sequences, 
which correspond to the symbol-sequences of the pair of points (/(c), f(k)). 
As mentioned before, the kneading data uniquely determine some important 
properties of the map. 

The definition of the ordering on symbol-sequences for the unimodal case 
from the previous section, can be retained verbatim for the bimodal case, if 
we define the ordering on the elementary symbols asL<C<R<K<B. 
The parity of common prefix of two symbol strings is still determined by 
the number of i?'s in it. This is due to the fact that R corresponds to the 
symbol for that portion of the interval /, on which / is monotonic decreasing. 
Thus defined, the ordering on symbol- sequences respects the ordering on the 
interval as before. 

We now give the description of the onset of chaos, corresponding to the 
first accumulation of period-doubling for bimodal maps. We define operations 
I and r on pairs (A, Y) of finite (possibly empty) sequences of {L, R, B} by 



where U, U are chosen from {R, B} so that XU < XK < XU and V, V are 
chosen from {L, R} so that YV < YC < YV . 

Given a finite sequence s of Z's and r's, we define 



(A, Y) I = (A, YVXUY) , (A, Y) r 



(XUYVX, Y) 



(1) 



(A s ,y s ) = (0,0)s 



(2) 



and write U 8 , U s , V s , V s for the U, U, V, V corresponding to A s , Y s 
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Pairs (P, Q) of sequences (possibly infinite) over the symbol-set £ can 
be put into a one-to-one correspondence with points in the unit square (see 
Figure which we denote by G. For (P , Qo) £ G, we define a wedge (see 
Figure 1(m)) 

w(P„, Qo) = {(P, Q) e G : P > P , Q < Q }. 

Then define 

R n = |J ^((X^nC) 00 , (F S KX S K)°°) for n>0 

\\s\\=n 

where ||s|| is the length of s. 

All maps on the boundaries of the wedges defining R n have a singly 
superstable, period 2 n+1 orbit passing through C or K. 

We also define regions S n ,n = 1,2,3..., such that S n contains maps 
which have certain period 3 • 2 n_1 orbits. The definitions of these S n is 
cumbersome. We reproduce it here for completeness. 

Si = u((RLR)°°, (LRR)°°) \Joo((BRR)°°, (RBR)°°) 

while for n > 2, 

S n = |J { io (((XUYVXUYVXUYV) S )°° , ({YV XUYV XUYV XU) : 

\\s\\=n— 2 



U oj {{(XUYV XUYV XUYV) s ) , {(YV XUYV XUYV XU) 
U w (((XUYV XUYV XUYV) s )°° , ((YV XUYV XUYV XU) 
U oo (((XUYV XUYV XUYV) s )°° , ((YV XUYV XUYV XU) S )°°)} 

where (ABC ...) s = (A S B S C S ...). 

The regions Pi and Si are shown in Figure l(m). 

With these definitions, following [§, we define subsets of G, which will 
serve as the regions of order and chaos. 

-Roo = [jRn = {J(G\Rn) 

n n 

Soo = U S n 

n 

D = GV^ooU^oo) 
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Theorem 1 of |J tells us, that the set of periods of a map contained in 
the region is given by {1, 2, 4, . . . , 2 n } for some n. Moreover all such 
maps are contained within it. Thus maps within this region exhibit ordered 
dynamics (and have zero topological entropy). 

On the other hand, a map contained in the region has at least one 
periodic orbit with a period, which is not a power of 2. Moreover all such 
maps belong to this region and have positive topological entropy. Hence, we 
consider Sqo as the region of chaos. 

The set of periods of a map in the boundary D is given by {1, 2, 4, . . . , 2 n , . . .}. 
Again, the converse is also true in this case. We will consider D as the 
"boundary" of order and chaos. Whether it is a boundary, in a precise topo- 
logical sense, of the regions .Roo and Soo is a long-standing conjecture. 

3. The "extended" Chomsky hierarchy 

The classical theory of automata and formal languages, has its roots, on 
the one hand, in the work of Turing, Church and others who developed the 
foundations of the theory of computation and on the other, in the work of 
formal linguists, notably Chomsky. In the present section we present a brief 
outline of this very large body of work, with an emphasis on that part which 
will be relevant for our purpose. For the details the reader is referred to the 
classic book by Hopcroft and Ullman ||. 

The entities that automata theory deals with are formal descriptions of 
devices which can perform "mechanical" tasks of varying degrees of difficulty 
These devices usually, have an input tape containing symbols from an alpha- 
bet, a memory unit and an auxiliary tape which is used to perform storage 
and output. The dynamics of the automaton is described by defining the ac- 
tion^) to be performed, depending on the symbol being currently read on the 
input tape, the present memory state and the current symbol on the storage 
tape. These actions might entail a change of the memory state, alteration of 
the current symbol on the storage tape and motion along the input tape in 
either direction. The Turing machine is probably the best known example of 
an automaton. The thesis of Church and Turing states that it is equivalent 
to any other system that can perform a task that can be defined algorith- 
mically. The central column in Figure 2 shows various classes of automata, 
that form a hierarchy of machines in terms of their computational power. 
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Formal languages are sets, containing finite strings (called words) of sym- 
bols from some (finite) alphabet. The most common examples of formal 
languages are the programming languages that are used today. These are 
contained in the class of formal languages called the context-free languages. 
One way of describing formal languages, is through the grammars that gen- 
erate them. A grammar contains, not only the symbols which form the 
words in the language that it generates (these are called terminal symbols), 
but also an auxiliary set of symbols, called the non-terminal symbols. The 
"productions" of the grammar, form the rules by which a given string of 
terminals and non-terminals, can generate another string of terminals and 
non-terminals. A special non-terminal (usually denoted by S) is designated 
the "start" symbol. Any string formed from the terminal symbols only, that 
can be obtained from the "start" symbol, by applying the productions suc- 
cessively, is a valid word in the language. The columns on the left and right 
in Figure 2, show a hierarchy of the classes of languages and their generative 
grammars. It is a beautiful fact of computation theory that the languages 
generated by the different grammars, are also "recognised" by the various 
classes of automata shown in Figure 2. 

This hierarchy of languages, grammars and machines goes under the name 
of the Chomsky hierarchy^. 

A very natural problem that arises in computation theory, is to determine 
the class in which a given language belongs. For this purpose, the theory 
provides a powerful set of results which are called intercalation theorems or 
sometimes pumping lemmas. These are necessary conditions for a language 
to belong to a particular class. The usual application of a pumping lemma is 
to exclude a given language from a certain class. Most commonly, the form 
of a pumping lemma is as follows: Given any word belonging to a language 
(in a certain class), it is possible to find a sub-word(s), of this word, of length 
not greater than k (an integer that depends only on the language and not the 
word) such that all the words obtained by "pumping" (i.e. adding another 
copy of the sub-word at its position in the given word) the sub-word finitely 
often, would all belong to the language. In the next section we use the 
pumping lemmas for the one-way, non-deterministic stacked languages and 

2 Conventionally the Chomsky hierarchy does not contain the indexed and the stacked 
languages and the automata that recognise them. We call this the "extended" Chomsky 
hierarchy. 
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the indexed languages. 



4. The results and proofs 

In this section we present the main results of this paper and their proofs. 
We first define formal languages corresponding to the onset of chaos in the 
unimodal and bimodal cases and then obtain the classes in the Chomsky 
hierarchy where these languages occur. The key ingredient in each proof is 
a relevant pumping lemma. As the statements of these lemmas are rather 
involved, we will use the lemma and refer the reader to the literature for its 
precise statement. 



4.1. The unimodal case 

In the notation of section 2.1, we introduce a sequence of languages A n = 
{K r : < r < n}. The words in these languages are formed from the 
alphabet A' = {L, C, R} U{(, )°°}- Moreover for each n, A n is finite and is 
contained in A n+ i. Thus each A n is a regular language. The limit language 
Aoo can be thought of as a symbolic description of the edge of order and chaos 
corresponding to the sequence of period-doublings of unimodal maps. Our 
concern here is to describe the place, that this language has in the "extended" 
Chomsky hierarchy. This is summarised by the following theorem, 



Theorem 4.1 

(i) Aoo is not a one-way, non-deterministic stack language. 

(ii) Aoo is an indexed language. 
Proof: 

(i) We use the pumping lemma for one-way, non-deterministic stack 



languages, which can be found in [1C]. Suppose is a one-way, non 



deterministic stacked language. Consider the integer k in Theorem 1 (hence- 



forth referred to as OG1) of fTO] . Let £ be a word in Aoo of length greater 
than k. The pumping lemma then guarantees the existence of a string of 
words £j, i — 1, 2, 3 . . . obtained by intercalating strings within £ as described 
in OG1. If ||£|| denotes the length of the word £, we have from OG1, for all 
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i > 0, 

ll&ll - II6-1II = INI + INI + INI - \Wi-i\\ 

= a + ib (3) 

where a is the sum of the lengths of all the <x,'s, 7/s, S^s, Xj% "^j's an d b is 
the sum of all /3j's that appear in the definitions of pi, <Ji and Tj. 
(0) is easily solved recursively, to give 

P{i) = \\^\ = \\^\\ + ai + bi{i-l)/2 (4) 

Now observe that for a word A in Aoo, || A|| = 2 m + 2 for some m > 0. Since 
each £j is in ^.qo we have 

P({) = 2 m + 2 (5) 

for every z > 0, for some m G N. That this is impossible is most easily seen 
as follows: 

P'(x) = P(x) — 2 is a polynomial taking integer values on the set of integers. 
Hence the set {P'(n) : n G N} has an infinite number of prime divisors (this 
is rather easy to prove). But from ([5]) P'(x) is contained in {2 n : n G N}. 
This leads to a contradiction and completes the proof. 



(ii) The simplest proof of the fact that Aoo in an indexed language is 
probably obtained by observing that Aoo is a 0L language (see |J pg. 390ff, 
for the definition) and hence is also an indexed language. We will, however, 
explicitly define the indexed grammar which generates Aoo- 

Consider the indexed grammar defined as: 
The non-terminal symbols are {S, T, X, U, U}, with S as the start symbol. 
The terminal symbols are {L,R,C, (, )°°}. The index-productions are (here 
e is the empty string) 



xf - 


-> xux 


Uf 


-»• u 


Uf 


-> u 


Xg - 


-> e 


Ug 


-> L 


Ug - 
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The productions are 



S — > (C)°° 

S — > (TyC) 

T — >■ T/ 

T — ► XUX 



The derivation of the words in Aoo using the indexed grammar proceeds 
as follows: 

As is conventional, A =>- B means that B is derived from A using a 
single production (or index production), while A 4> B means that there 
exist Ci, C 2 , . . . , C n such that A =>- C x =>• C 2 =>• ■ ■ ■ =>• C„ =>- R Here 
A,B,Ci are words formed from the terminal and non-terminal symbols 

5 (C)°° 

5 =>. (T^)°° =>- {XgUgXgC) 00 4> (iZC) 00 



4 



(77<?C) C 



(XfgUfgXfgCY 



JJ, * 



(XgUgXgUgXgUgXgCY 



(RLRC) C 



(Tff.-.fgCy 
n times 



4.2. The bimodal case 

We define the language which would describe the first accumulation of period- 
doubling for the bimodal maps. In the terminology of section 3b, we describe 
each wedge u>(P,Q) by the word (P : Q). The region R n is obtained as a 
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union of wedges, each of which corresponds to a unique finite string s of Vs 
and r's of length n. 

To describe the word corresponding to R n we first introduce the lexico- 
graphical ordering on strings s, induced by the ordering / < r on the symbols 
of s i.e. if Si = xa . . . and S2 — xb . . . are two strings on {I, r}, with a com- 
mon prefix- sequence x and a, b G {l,r},a ^ b, then Si < s 2 iff a < b, else 
Si > S2- Note that this ordering is different from the ordering on words we 
had defined in section 3.1. 

Remark (1): The words s can be generated as the nodes of a binary tree. 
The lexicographical ordering is then the conventional ordering of nodes from 
left to right of a binary tree (see Figure 3). We will refer to this tree as %. 

With this we now define the word corresponding to i?„ as follows: 

(a) Arrange the wedges occurring in the definition of R n , in increasing 
lexicographical order of the strings s that each wedge corresponds to. 

(b) Concatenate the words corresponding to each wedge in the same order 
to form the word corresponding to R n . 

In exactly the same fashion we can obtain the words which describe S n . Let 
w(R n ) and w(S n ) be the words which describe R n and S n respectively. 

Remark (2): The length of the word representing a wedge in the definition 
of R n is of length 2 n+2 + 7. This follows from the fact that if s has length n, 
then the lengths of X s and Y s add up to 2 n+1 — 2. Similarly the length of a 
word that describes a wedge in S n is 3 • 2 n + 7. In particular, note that the 
number of symbols that occur between two consecutive occurrences of the 
symbol "(" , in the definitions of R n and S n , is a (non-constant) function of 
n. 

We define a sequence of languages <Bj, i — 1, 2 . . . on the alphabet 
£' = £ (J {(, ), :, ;, (, )°°} 

as follows: 

Bi = {w(Rj): j = 1,2... i}. 
This language can be thought of as describing the region U}=i Ri- 
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As an example, we construct the language B\. 

X = 0, Y % = 0, U 9 = R, U® = B, V® = R, V % = L 
Xl = 0, Y x = LR, U l = R,U l = B,V l = L,V l = R 
X r = BR, Y r = 0, U r = B, U r = R, V r = R, V r = L 

w(Rl) = ((RLRC)°° : (LRLK) 00 }((BRBC) CO : (RBRK) 00 ) 
Bi = {((RLRC) 00 : (LRLK^dBRBC) 00 : (RBRK) 00 )} 

The limit language can be thought of as describing the "boundary" 
D of order and chaos. The following theorem characterises B^ with respect 
to the Chomsky hierarchy 



Theorem 4.2. 

(i) Boo is not an indexed language. 

(ii) Boo is a context-sensitive language. 
Proof: 

(i) The pumping lemma for indexed languages is given in ||11|| . Our proof 
closely follows Theorem 5.3 (henceforth referred to as HA5.3) of We 



urge the reader to refer to |TT| for a description of the notation that we use 
in this proof. 

Suppose Boo is an indexed language. Choose "(" as a special symbol of 
£'. With every word of an indexed language, we can associate a derivation 
tree (see fn| ). A node, p, of a derivation tree, 7, is said to be a P-node, if 
there exist at least two distinct sub-trees under it, each of which contains at 
least one node with the label "(". A pair of nodes pi,P2 of 7 are said to be 
CF-like if 

(a) p 2 is a descendent of p\ 

(b) pi and P2 have the same labels. 

(c) there exists a P-node p, such that p is a descendent of p\ and P2 is a 
descendent of p. 
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If 7 contains no CF-like pair of nodes, it is said to be non-CF-like. 

We now show that if ip is a word in Boo of large enough length, then parts 
of it can be intercalated in a way such that the resulting words would not 
belong to Boo- Choose ip to be any word in which the number of occurrences 
of the symbol "(" is more than k', where k! is an integer which depends only 
on B^ (and is defined in HA5.3). Say 7 is the derivation tree of ip. We 
consider two cases. 

Case(a): 7 is non-CF-like. 

In this case, following the proof of HA5.3 we can establish that there is 
a decomposition 7 = a ■ f3 ■ 5 ■ r • v such that either a or v has at least three 
P- nodes. Then by intercalating parts of 7 we obtain a sequence of trees 8 n , 
n = 1, 2, 3 . . . each of which has a and v as its (first and last respectively) 
components. Moreover the word g(6 n ) of which 6 n is a derivation tree, be- 
longs to Boo and ||(7(0n)|| < Hs^n+i)!!- This means that there exist words 
of increasing length in Boo such that each of them contains a fixed sub-word 
of the form (■ • • ( (because either a or v has at least three P-nodes). This 
contradicts Remark (2). 

Case(b): 7 has a CF-like pair of nodes. 

In this case the proof of lemma 2.1 of |Tl| guarantees a decomposition 7 = 



a ■ (3 ■ 5 such that (3 contains a P-node. Moreover for each n = 1, 2, 3 . . ., there 
exists a sequence of trees 

7„ = a ■ P ■ (3- ■■ (3 -5 
n times 

such that each g{j n ) G Boo and ||g(7n)|| < ||fi l (7n+i)||- This implies that for 
every n > 1, g(7 n ) contains a sub-word of the form (•••(•• ■) n (Note that by 
•••(••• we mean a fixed sub- word of that form). Since the 7 n 's belong to Boo 
we have a contradiction with Remark (2). 



(ii) We prove that Boo is a context-sensitive language by showing that it in 
fact belongs to the complexity class DSPACE(n). The proof that the class of 
context-sensitive languages is equivalent to the complexity class NSPACE(rz) 
(which contains DSPACE(n)), can be found in JJ. 

To show that a language is DSPACE(n), we need to show that the words 
in the language can be recognised by an off-line, multitape Turing machine 
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such that the size of every tape (including the read-only tape) is limited to be 
of the size of the input word (buffered on either side by end-marker symbols) . 
We will informally describe the steps in the algorithm required to recognise 
the input string. It will be clear from our description that each subroutine in 
this algorithm can be implemented on a bounded tape (or sometimes a pair of 
tapes) of our machine. The number of subroutines involved in the description 
will thus determine the number of tapes of the machine. We buffer each tape 
of the machine with end-marker symbols. If for a given input string, the head 
on any output-tape reaches an end-marker, the machine halts in a non-final 
state and thus fails to accept the string. It will be clear that for a string in 
Boo this will never happen. 

In what follows we denote the tapes of the Turing machine by ti,i = 
0, . . . , N. to is the input-tape which is read-only. Auxiliary tapes necessary 
to perform computations at Step(i) will be denoted as t ai) tu etc. 

Step(l): Determine n for the given input word i/j. 

We wish to place n l's on t\ to determine the depth of the binary tree 
(see Remark (1)) where ip could occur. This is easily possible using Remark 
(2). Since the first wedge?] in any word of B^ has length 2 n+2 + 7 for some 
n, we can use two auxiliary tapes t i,t&i to evaluate n. 

Step(2): Determine the possible strings s on {I, r} of length n, in lexico- 
graphical order. 

We now want to obtain the nodes of the tree T s (see Remark (1)) at 
depth n. These nodes will be written on t 2 i n the lexicographical order from 
left to right and will be separated by commas. To achieve this we need two 
auxiliary tapes t a2 ,tb 2 . For every 1 encountered on t 1 , the tapes t a2 and 
can be used alternately to generate the nodes of T s at successive depths by 
copying and prefixing. This is continued till the first blank is encountered on 
ti, when the process stops and the contents of the last auxiliary tape written 
on, are copied to t 2 . As the formal description of the entire procedure is 
cumbersome, we leave it to the reader to check it. It is easy to see that the 
limited size of the tapes available, in fact, suffice for this purpose. 

Step(3)-(8): Determine X s , Y s , U s , U s , V s , V s for an s on t 2 

For an s that has been written on tape t 2 , we wish to determine the corre- 

3 In what follows we will use the terms "wedge" and "the word representing the wedge" 
interchangeably when there is no danger of confusion. 
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sponding X, Y, U, U, V, V. These will be stored on tapes t% to t$ respectively. 
As the process is recursive we can use auxiliary tapes t ai , tui = 3, . . . , 8 for 
each main storage tape. While the determination of X, Y requires mere copy- 
ing from one tape to another, the determination of U, U, V, V will require the 
determination of the parity of the number of i?'s in X, Y. 

Step(9): Determine the word corresponding to R n in B^. 

For each s the wedges that occur in the definition of R n are easily obtained 
by copying the X, Y, U, U, V, V in the relevant order to tg. Control then 
returns to Steps(3)-(8) where X, Y, U, U, V, V corresponding to the next s 
are obtained and so forth. 

After this the word in t 9 is compared symbol-by- symbol with that in to. 
If it is the same the machine halts in a final state or else it halts in a non-final 
state. 

We observe that if ip e B^, then each step in the process described above, 
could be carried out on the bounded set of tapes that was available. Thus if 
during any of these steps, a head reaches the end of a particular storage-tape, 
we are guaranteed that ip ^ B^ and the machine would halt in a non-final 
state, rejecting the word. This completes the proof. 

5. Approaching the Onset Of Chaos 

Let us first consider the description of the onset of chaos in the unimodal case. 
It might seem at first sight that the description of this set by means of the 
language Aqo as being rather ad hoc. In fact instead of choosing to approach 
the accumulation point through a sequence of superstable bifurcations, we 
might as well have chosen any of the other sequences available. However, it 
is easy to see that the kneading sequences within a periodic window are very 
simply related to each other. We could in fact have included every kneading 
sequence less than Xoo to form a new language Coo- This new language would 
be described simply as 

where (A^) denotes the language containing the kneading sequences to 
the left (resp. right) of the superstable sequences in each periodic window. 
A proof almost identical to the one given for Aoo however shows that both 
A^ and A^ (and hence Coo) are indexed languages. 
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A more pertinent question is what happens if we are to approach the on- 
set of chaos from the chaotic side. This question is rather tricky. For the sake 
of discussion let us fix the family of maps to be the logistic family, described 
by the equation f(x) = fix(l — x). It will be clear that everything would go 
through for a much larger class of maps as well. We could approach the accu- 
mulation point Hoc from the right through a sequence of band-merging points. 
The kneading sequences at these points are most conveniently described by 
the so called *-operation, originally discovered by Derrida, Pomeau and Ger- 
vois [12]. We give here the definition which best suits our purpose. Given 
a word K" on the symbols {L,C,R}, R * K" is obtained by applying the 
following transformations simultaneously on every symbol of K": 

L^RR 
R^ RL 
C ^RC 

We begin with the region of well- developed chaos at fi — 4. The kneading 
sequence at this point is K" = R(L)°°. The band-merging points are then 
obtained through successive applications of the ^-operator as 

K» = R*K'{, 
K'i = R*R*K'{, 

K = {R*) n Kl 



Now consider the language A!^ = {K£ : n = 1,2, . . .}. We could think of 
this language as another description of the edge of chaos, this time from the 
region greater than /Xqq. In fact the definition of the *-operator immediately 
confirms that this is a OL language and hence is also an indexed language 
(see 0). Note that we could have discussed the languages Aoo and A'^ using 
the *-operation as well (see, for example, []5J). 

We could now ask the same question that we asked before. What about 
other descriptions of the onset of chaos from the chaotic part of the spectrum? 
We do not at present have any good answer to this question. As a preliminary 
observation we might note that we could have chosen to approach the onset 
of chaos through the kneading sequences K' n described in section 2.1 and 
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could have described yet another language A'^ = {K' n : n = 1, 2, 3 . . .}. Of 
course this language is also an indexed language. 

However, there is in fact a crucial difference between the language Aqo 
and the languages A'^ or or A'^, which, though obvious, might be well worth 
pointing out: each word in Aoo describes a stable periodic point, whereas 
that in describes an unstable periodic point. In the chaotic regime the 
attracting sets would be described by aperiodic symbol-sequences which are 
infinitely long. Unfortunately, classical computation theory does not consider 
within its domain, languages whose words might be infinitely long. In fact 
the behaviour of classical computational devices on words of infinite length 
is very different from their behaviour on words of finite length (see ||13|| ). 
We speculate that a description of computation over the field of reals, when 
sufficiently elaborate so as to provide an analogue of the Chomsky hierarchy 
might throw light upon these issues. The recent developments in this direc- 
tion due to Blum, Shub and Smale |14| might provide the seeds of the such 
a theory. 

The drawbacks mentioned above also apply to our discussion of the bi- 
modal maps. We could as in the unimodal case, choose to approach the edge 
from the chaotic side. In the notation of sections 2.2 and 4.2, the language 
^oo = i w (Sn) '■ n — 1,2,3. . .} can be considered as describing the region 
D, thought of as the boundary of the region S^. The proof given in sec- 
tion 4.2 goes through almost unaltered even for this case. Thus B'^ is a 
context-sensitive language. 



6. On complex descriptions. 

In conclusion we would like to place our results in perspective. What paradigm 
do these results suggest for a definition of complexity? In order to address 
this question we must first inquire, into the process of description. The scien- 
tific description of phenomena normally involves two aspects. The first is the 
specification of the model-class. The second aspect involves the description 
of the phenomenon at hand, with respect to this model-class. Let us call 
this process, interpretation. For a description to be "useful" we must ensure 
that that both the model-class and the interpretation have been specified in 
finite terms. Given this rather simplistic picture of the modeling process we 
now ask, how complexity arises or, more specifically, why are some phenom- 
ena more complex than others? Consider a phenomenon which resists finite 
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interpretation with respect to a certain model class. In order to describe 
it we would, then, have to construct a "larger" model-class. This could be 
regarded as signalling complexity. 

To illustrate this in the context of our results, consider the automata or 
equivalently the grammars in the Chomsky hierarchy as representing model- 
classes. Let the behaviour of maps (unimodal or bimodal) represent the entire 
class of phenomena to be described. The symbolic dynamics, giving rise to 
a language, and the explicit construction of a grammar (corresponding to a 
given model-class) generating that language, at each value in the parameter 
space, can serve as an interpretation for the phenomenon. Together these 
constitute a description of the behaviour of the map in question. Now, at 
the onset of chaos we observe that we are forced to change our model-class (in 
the unimodal case, for example, from the regular grammars to the indexed 
grammars). In fact in our case we have proved, that no finite interpretation 
can be obtained of the edge of chaos in terms of the older model-class. This 
describes the complexity at the onset of chaos. 

Finally we would like to point out that unlike conventional statistical- 
mechanics that relies on numerical or quantitative classes (say using critical 
exponents), the paradigm suggested above favours a more qualitative defini- 
tion of complexity. What one is tempted to observe, is that, complexity is 
not so much a matter of number, as it is of mechanism. 
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Note added: After most of this work was completed, we discovered that 



Crutchfield and Young [|15|, |1| have also shown that the formal language 
corresponding to the unimodal case is an indexed language. Their approach, 



based on the e-machine reconstruction, is more general but different from 
the one (based on symbolic dynamics) pursued in this paper. 
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Figure 1: («) The space G of pairs (P,Q) of sequences on {L, C, R, K, B}. (n) The 
wedge u>(Po, Qo)- ( M The regions G\Ri and Si. Taken from Mackey and Tresser[6]. 




Figure 2: The "extended" Chomsky Hierarchy. The single-headed arrows denote strict 
inclusion. The double-headed arrows denote an eqi valence. The central column con- 
sists of the hierarchy of automata, the left column of the hierarchy of generative gram- 
mars and the right column of the hierarchy of formal languages. In the conventional 
Chomsky Hierarchy, the stack and the indexed languages do not occur. 




Figure 3: The tree T s . 



